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ABSTRACT 

We present a large X-ray selected serendipitous cluster survey based on a novel joint analysis of 
archival Chandra and XMM- Newton data. The survey provides enough depth to reach clusters of flux 
of ~ 10 -14 ergs cm -2 s _1 near z ~ 1 and simultaneously a large enough sample to find evidence for 
the strong evolution of clusters expected from structure formation theory. We detected a total of 723 
clusters of which 462 are newly discovered clusters with greater than 6a significance. In addition, we 
also detect and measure 261 previously-known clusters and groups that can be used to calibrate the 
survey. The survey exploits a technique which combines the exquisite Chandra imaging quality with 
the high throughput of the XMM- Newton telescopes using overlapping survey regions. A large fraction 
of the contamination from AGN point sources is mitigated by using this technique. This results in a 
higher sensitivity for finding clusters of galaxies with relatively few photons and a large part of our 
survey has a flux sensitivity between 10 -14 and 10 -15 ergs cm -2 s _1 . The survey covers 41.2 square 
degrees of overlapping Chandra and XMM- Newton fields and 122.2 square degrees of non-overlapping 
Chandra data. We measure the log N-log S distribution and fit it with a redshift-dependent model 

_ z 

characterized by a luminosity distribution proportional to e z ° . We find that zq to be in the range 
0.7 to 1.3, indicative of rapid cluster evolution, as expected for cosmic structure formation using 
parameters appropriate to the concordance cosmological model. Confirmation of our cluster detection 
efficiency through optical follow-up studies currently in progress will help to strengthen this conclusion 
and eventually allow to use these data to derive tight contraints on cosmological parameters. 
Subject headings: cosmologyiobservations — galaxy: clusters — X-rays:galaxies:clusters 



1. INTRODUCTION 

There is a long history of compiling catalog s of nearby 
galaxy clusters by using optical telescopes jAbell 1958; 
IZwickv et al.|[l96l| ). The development of focusing X-ray 
telescopes a nd the early discover y of X-rays from clusters 
of galaxies (|Bvram et al.l Il966l ) provided another sen- 
sitive method for finding clusters of galaxies. A num- 
ber of X-ray catalogues now exist for the brightest clus- 
ters. Most of the nearby clusters are known from flux- 
limited surveys that cover a large fraction of the sky 
from ROSA T and Einstein down to 10 -12 e rgs cm" 2 s -1 
(REF LEX, iBohringer efaTl 120041 : BCS, lEbeling et all 
119981 : EMSS. lGioia et al.lll990D ~These surveys have dis- 
covered 100-450 clusters. Deeper surveys that extend 
to 10 -14 ergs cm -2 s _1 and therefore probe a larger 
range of redshifts have found 60-200 cluster s serendipi- 
tously (160 square deg.JVikhlinin et al.lfl998l: 400 square 
deg. iBurenin et al l [20071 : ROSAT NEP. iGioia et al l 
I 2001L iHenrv et al. 1 12006: WARPS, IScharf et all I1997L 
iHorner et al. 1 120081 ). XMM- Newton surveys have found 
19 clusters serendipitously (|Georgantopoulos et al.l l2005) 
and 12 clusters in the Large Scale Structure Survey 
(| Willis et al.| [2QQ5) down to deeper flux levels. 

The slope of the log N-log S distribution for clusters is 
close to -1 down to a flux limit of « 10 -13 ergs cm -2 s _1 
which means that a wide survey with short exposures 
will approximately detect the same number of clusters 
as a narrow survey with longer exposures for the same 



total exposure time. Thus the total number of clusters 
is simply proportional to the product of a telescope's 
etendue (effective area times field of view) times the to- 
tal survey exposure time divided by the minimum num- 
ber of photons that one can reliably use to identify a 
cluster unambiguously. A large number of the previ- 
ous surveys used ROSAT which has an area of 200 cm 2 
and a field of view of 3 square degrees. The fields of 
view of Chandra (0.1 square degrees) and XMM-Newton 
(0.2 square degrees) are not particularly high, but the 
effective area of Chandra (400 cm 2 at 1 keV) and es- 
pecially XMM-Newton (2000 cm 2 at 1 keV) are high. 
XMM-Newton, in particular, has an etendue compara- 
ble to ROSAT. One difficulty with using XMM-Newton 
to perform cluster surveys compared to ROSAT is that 
the large area and smaller field of view will collect on 
average more distant groups of galaxies, with smaller 
angular size, because the survey will be deeper rather 
than wider for the same exposure time. The fact that 
the clusters have smaller angular size means that it is 
harder to determine the difference between a cluster or 
group and a point source. On the other hand, Chan- 
dra has a exquisitely sharp point-spread function (PSF) 
that distiguishes AGN point sources from clusters at all 
redshifts across its entire field of view. Therefore, by 
implementing an analysis method which combines the 
Chandra PSF with the XMM-Newton throughput, we 
demonstrate that we can produce a large sensitive X-ray 
cluster survey. 
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ods have converged on a standard cosmological model, 
which indicates the presence of a poorly understood dark 
energy component. An extensive search for new galaxy 
clusters will provide new constraints on cosmological 
models. In general, the abundance and distribution of 
clusters of galaxies in the Universe provides precision and 
com plementary cosm ological constraints to other meth- 
ods (|Eke et al.lll99 6). In particular, the measurements 
are most sensitive to <78^ m , where o~s is the normalization 
of the matter power spectrum on 8 Mpc scales and Qm is 
the matter density. A large enough sample at a range of 
redshifts can constrain w, the equation of state parame- 
ter fo r dark energy (e.g.. lHaiman et aHl200ll . lWang et al.l 
2004). The distribution of clusters as a function of mass 
and the spatial power spectrum constrains cosmological 
parameters as well. 

The content of this initial paper principally concerns 
the exposition of the new approach for detecting a large 
sample of X-ray clusters based on a combined analysis of 
the all useful data in the full Chandra and XMM- Newton 
archives. We have discovered a new unbiased sample of 
462 clusters that reaches the most sensitive flux limit 
achieved to date ( 10 -14 ergs cm -2 s _1 ). We summa- 
rize the cosmological implications of this new information 
with an initial limited approach that assumes the concor- 
dance model of the universe is correct and compares the 
measured Log N-Log S curve of the new X-ray cluster 
sample with what would be expected in a concordance 
Universe. The limited scope of this approach precludes a 
full analysis of the consistency of these new data and the 
concordance model. We do confirm approximate consis- 
tency with the concordance model in two independent 
ways. First we show that the rapid deviation of the Log 
N-Log S curve for fluxes less than ( 10 -13 ergs cm -2 s _1 ) 
for a universe without evolution of the density of clus- 
ters is consistent with an expected exponential decrease 
in cluster density for clusters near z « 1 and beyond. 
This result approximately matches the expected density 
for clusters of mass range expected at a distance of z 
~ 1 (see figure 15). Secondly, assuming a concordance 
model, we compute the expected Log N-Log S curve and 
find approximate agreement with the newly measured 
Log N-Log S curve (see figure 16). The detailed expla- 
nation of these two comparisons assuming a concordance 
model appears later in this paper. A full analysis beyond 
approximate consistency with the concordance model is 
beyond the scope of this initial paper. In particular, 
we must improve estimates of several possible system- 
atic errors including the volume of the surveyed region 
in three dimensions, the exact detection threshold near 
the flux limit, the re-calibration of the temperature, size, 
luminosity and mass relations of X-ray selected clusters. 
We present the log N-log S of the unbiased sample of 
462 new clusters without redshift measurements because 
redshifts (and thus luminosities) are not needed for this 
initial simple analysis. Future work will address the full 
cosmological implications of these measurements beyond 
just the Log N-log S curve for clusters. 

2. BASIC METHODOLOGY 

X-ray observations that have both XMM- Newton and 
Chandra data for the same piece of the sky have the ad- 
vantage that they can use both the Chandra PSF and the 
XMM-Newton throughput. For example, finding three 



photons within one or two arcseconds of each other in 
a 5 ks Chandra exposure can umambiguously identify 
an AGN. The probability of background fluctuation or a 
diffuse source creating that situation is negligible. The 
number of photons required to determine a point source 
in XMM-Newton is much higher. 

On the other hand, XMM-Newton has ~5 times the 
effective area and twice the field of view of the Chandra 
telescope. So over both missions' lifetimes XMM-Newton 
has collected 10 times as many photons as Chandra from 
clusters. Therefore, by simply using Chandra data to find 
the positions of AGNs, we can subtract photons from the 
XMM-Newton data and select clusters with as few as 10 
to 15 photons. We therefore have a sensitivity several 
times higher than if we used XMM-Newton data alone. 
It may be possible to use the full XMM-Newton dataset 
after calibrating the AGN contribution precisely. The 
major challenge is to learn how to combine and calibrate 
all the data and correct for the non-uniformity of the 
survey. New analysis techniques are required, since the 
exposures are very deep and contain many smaller clus- 
ters at high redshift. 

3. CLUSTER SELECTION PROCEDURE 

Following we describe the basic data processing, im- 
age reconstruction, point source removal procedure, and 
source detection and selection methods. 

3.1. Basic Data Processing 

To start this project, we first scanned the XMM- 
Newton archive for any fields that could potentially over- 
lap with Chandra fields for all data in either archive as of 
September 2006. Using these fields we can combine the 
advantages of the Chandra PSF and the XMM-Newton 
throughput in the following method. A remarkably large 
fraction (nearly half) of the observations overlap with 
Chandra observations in at least some part of their field 
of view. This project clearly benefits from the fact that 
X-ray observers tend to choose similar targets with both 
observatories. We downloaded 1201 XMM-Newton fields 
that overlapped with the the 2706 Chandra ACIS-S or 
ACIS-I fields. We excluded fields within 10 degrees of 
the Galactic plane or within 5 degrees of the SMC and 
LMC. We also included fields that have only Chandra 
data, since although they will not benefit from the XMM- 
Newton throughput they can be used to augment the sur- 
vey with more data with high PSF quality. The positions 
of the observations are shown in Figure 1. The survey 
contains approximately 122.2 square deg. of Chandra- 
alone data with an average exposure of 44 ks and 41.2 
square deg. of overlapping XMM-Newton and Chandra 
data with an equivalent Chandra exposure of 166 ks. 
Thus, the Chandra-alone data contains about 44% of the 
cluster candidates and the overlapping survey will con- 
tain about 56% because of the relative effective etendues 
of each sub-survey. A total of 54 Ms of Chandra data 
and 7 Ms of XMM-Newton data were used. 

We used the Chandra level 2 events and applied the 
standard destreak processing tool. We processed all the 
XMM-Newton data using the SAS pipeline. The Chan- 
dra photon events were selected to have corrected energy 
(PI) between 0.5 and 7 keV, have no bad data quality 
flag, and have event pattern less than or equal to 4. The 
XMM-Newton photon events were selected to have cor- 
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Fig. 1. — A map of the survey in equatorial coordinates. The blue regions mark areas where there is either Chandra data or overlapping 
Chandra and XMM- Newton data. Approximately 0.4% of the sky is covered by this survey. Notably absent is the galactic plane data. The 
largest contiguous patch in the upper left is the Bootes Chandra survey. 



rected energy (PI) between 0.5 and 7 keV and an event 
grade of 0, 3, or 4. Background flares were removed from 
all the 3907 datasets by first binning the events in 100 
second time bins. We then iteratively removed 100 sec- 
ond intervals which have the highest count rate until the 
signal squared (proportional to the number of time bins) 
divided by the noise (proportional to the total counts) 
was maximized. This procedure optimally removed the 
variable component of the background (or a bright flar- 
ing AGN). We also removed observations with effective 
exposures (in ks) divided by the count rate greater than 
0.2. This removed extremely short exposures that almost 
entirely contain background flares. 

3.2. Image Reconstruction 

We scanned the entire sky and identified all photons 
from any of the datasets that could have originated in 
a given grid cell of size 1 square degree. On average 
there are usually about 2 different observations from ei- 
ther telescope that overlap with a given field position. 
Within each 1 square degree we only include cluster can- 
didates in the catalog if they fall within the central 0.5 
degree by 0.5 degree box, but we analyzed the entire set 
of photons in the box through the procedure we describe 
below. The 1 degree square regions were chosen, how- 
ever, so that they overlapped by 0.5 degrees, leaving no 
residual binning artifacts. 

We only included events from the Chandra data that 
were within 15 arcminutes of the center of the field of 
view. This was necessary because the size of the PSF 
increases significantly at larger angles, but still a large 
fraction of the data were included. This angle cut also 
means that some ACIS-S data was included with ACIS- 
I pointed observations and some ACIS-I data were in- 
cluded with ACIS-S pointed observations. For some par- 
ticularly deep observations with long exposures, we only 
included a randomly chosen set of 10 T photons and then 
scaled the measurements accordingly, since some of the 
algorithms we employ below have run times proportional 



to the number of photons squared. This was rare and 
only occurs for less than 1% of the datasets. 

For every arcminute of the sky, we computed the effec- 
tive exposure times telescope area (having units cm 2 s) 
including the effects of telescope vignetting. To do this, 
we assume that the ratio of effective areas of the ACIS- 
S, EPIC-PN, and EPIC-MOS were, respectively, 1.4, 4.8, 
and 1.6 times the ACIS-I area and then we scaled all re- 
sults of the effective area exposure map to ACIS-I. In re- 
ality, the relative area depends on the area of the source, 
but the values we found were an empirically best av- 
erages over all sources and background. Similarly, for 
the vignetting profile, we assumed a Lorentzian profile 
of the form 1/(1 + (0/0o) 2 ) where 0o is 15 arcminutes 
for XMM- Newton and 25 arcminutes for Chandra. This 
roughly matched the average energy dependence of the 
vignetting. 

We then removed XMM-Newton photons and set the 
XMM-Newton exposure-area to zero where there were no 
Chandra data in either that location in the 1 arcminute 
exposure-area map or in the adjacent 4 squares in the 1 
arcminute exposure- area map. This then conservatively 
removed any edge effects, and we are left with XMM- 
Newton events only where there is a complete set of 
Chandra events. A sample from the Lynx field is shown 
in Figure 2 after this step where there is a deep expo- 
sure with both Chandra and XMM-Newton. After this 
calculation, we have an estimate of the exposure-area for 
every square arcminute of the entire sky. For each ar- 
cminute of the sky, we also computed the distance to the 
aimpoint of any exposure for both Chandra and XMM- 
Newton exposures, in order to compare our candidate 
sources with an estimate of the PSF size. We also esti- 
mated the average count rate for each square degree of 
the sky for background estimation. 

3.3. Point Source Removal 

Using the Chandra photons alone we located the point 
sources by finding photons where the number of neigh- 
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Fig. 2. — A Chandra (left) and XMM-Newton (right) exposure of the Lynx field after the non-overlapping XMM- Newton data has been 
removed. The Chandra field is mostly an ACIS-I pointing and a small piece of the ACIS-S array on the left. This is an example of a deep 
exposure in the survey with both XMM-Newton and Chandra data. This pixel scale is 5 arcseconds, and the brightest regions correspond 
to roughly 100 counts per pixel. 






Fig. 3. — The upper left panel shows the AGN photons removed from the Chandra data using the method described in the text. The 
lower left panel contains the cluster candidate emission and background events. The upper right panel shows the corresponding XMM- 
Newton photons that were subtracted because they were at close to the same position as the Chandra data. The lower right panel shows 
the XMM-Newton photons after subtracted. The subtraction is not perfect, since the AGN sources vary and the method may over or 
undersubtract some AGN. However, it is clear most of the AGN emission complicating the source finding in the right in Figure 2 has been 
removed. 



bors within the local PSF size exceeded the number of 
photons within twice the local PSF size by two sigma. 
We only searched through a subset of photons in the rel- 
evant 1 by 1 arcminute block or the adjacent 1 by 1 ar- 
cminute block. This improved the computation efficiency 
of the algorithm. The local PSF size was estimated to be 
(3 + 20(^) 2 ) arcseconds where 0\ is 15 arcminutes. Pho- 
tons which satisfied this condition held were tagged as 
candidate "AGN" photons. We also randomly untagged 
a fraction of these photons in proportion to the relative 



count rate of photons with a local PSF size to twice a lo- 
cal PSF size. This allowed us to fill back in some photons 
which may be due to background. All Chandra photons 
were then tagged as either "AGN" photons like those in 
the top left panel in Figure 3 or "Cluster+Background" 
photons like those in the lower left panel in Figure 3. 

Then for each Chandra photon we removed a set of 
the nearest XMM-Newton photons (when there are any) 
given by the ratio of the exposure-area maps we com- 
puted earlier. The nearest XMM-Newton photon was 
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Fig. 4. — The cluster candidate and background photons obtained by adding the left images above is shown on the left. Similarly, the 
co-added AGN emission is shown on the right. The cluster and AGN candidates are then found on these maps using the wavelet method 
described in the text. The candidates are circled with a size proportional to the size determined from the wavelet methods. The thicker 
circles correspond to those that satisfy all remaining data cuts. In this case, the two prominent clusters near the center are selected. 




Fig. 5. — The source flux of the simulated clusters compared to the measured flux using the source detection algorithm as described in the 
text. Each point represents a different simulated cluster with a different background level (diamonds=10 - , squares^ 1CT 4 , triangles=lCr 2 , 
x sign=10°, and plus sign=10 2 counts per pixel). The source flux is the number of photons we put in image, where as the measured flux 
is the source counts estimated by the wavelet method. The source size is the core radius of the beta model and the measured size is the 
most significant size of the wavelet method times the correction factor. 



found by computing, 



Ax 2 + Ay 2 Ae 2 
V ( 10 ") 2 + (150eVy^7) 2 

where Ax and Ay are the difference in spatial coordi- 
nates, Ae is the difference in measured photon energy 
(in eV), and Ck e v is the photon energy. Thus, the se- 
lected photons were chosen to have a similar position 
and have similar energy. If the ratio of exposure-area 
maps was not an integer, we chose random numbers ap- 
propriately to decide whether to add extra photons to 
obtain the proper normalization. We also did not use 
XMM- Newton photons outside of a 2 by 2 arcminute 
block as with the previous step for computational effi- 
ciency. This method then selected the XMM- Newton 
AGN photons even if the XMM-Newton PSF was large 
and if it was difficult to determine if the source was a 



cluster or AGN from the XMM-Newton data alone. We 
then have two sets of photons for the XMM-Newton data 
as well: those that are removed are composed primarily 
of AGN photons (top right in Figure 3), and those that 
have not been removed that are either from clusters or 
background (bottom right in Figure 3). The method is 
imperfect when the AGN flux varies significantly between 
observations and due to noise, but we can estimate this 
by comparing the two maps. If we subtracted a lot of 
"AGN" photons at a particular location and we found a 
source in the subtracted map then this was likely due to 
a subtraction problem. This is described in more detail 
in the next section. 

3.4. Source Detection and Selection 

Next, we constructed the combined maps by adding the 
XMM-Newton and Chandra maps for AGN photons and 
clusters and background photons, respectively as shown 
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Fig. 6. — The full catalog of cluster candidates (continued on next pages). The subtracted AGN emission is shown in red, whereas the 
cluster emission is shown in blue. The dotted circle represents twice the fitted wavelet size of the source detection algorithm. 



in Figure 4. We only included photons from 0.5 to 2 
keV, since now we wanted to maximize our sensitivity 
to finding new clusters and most of the cluster emission 
occurs in the soft band. We binned the photons into a 
5 arcsecond grid and then find sources using a wavelet 
method which is a modified form of that described by 
Vikhlinin 1998. To do this, we computed functions of 
the following form: 



](x ,y Ch cr) 



1 



2ttg 2 



f(x,y)e~ 



xo) z + (y-yo) z 



where g is a gaussian convolution of the map / with scale 
a. These convolutions can be calculated most efficiently 
in Fourier space. An estimate of the number of photons 
of a given source at position (xq, yo) of size g is given by 



c(x,y) 



3(3:0, 2/o, cr) 
x 0o,2/o,cr) 



98c(xo,yo,2a) \ 

#eaOo,2/0,2<T) / 



where the g sc is the integral above using the soft band 
counts and g ea is the integral using the exposure-area 
function. When the area-exposure map is constant this 
function will be equivalent to the normal spherical top- 
hat function used in wavelet applications. We stepped 
through every pixel of the map using source sizes from 1 
to 32 pixels in step size of log2 (radius) of 0.2. We also 
estimate the hard band counts (2.0 to 7.0 keV) and the 
AGN soft band counts at the same position. The back- 
ground rate was estimated locally by convolving a gaus- 
sian with g of 480 arcseconds (3 times the largest source 
we allow). We found that the detection significance of a 
source could then be estimated by the formula 



significance = — / 



4(bG 2 ) 



-OA 



where c is the counts function defined in the previous 
equation, b is the background function using the previ- 
ous equation with width of 480 arcseconds, and n is a 
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Fig. 7. — The full catalog of cluster candidates (continued). The subtracted AGN emission is shown in red, whereas the cluster emission 
is shown in blue. 
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Fig. 8. — The full catalog of cluster candidates (continued). The subtracted AGN emission is shown in red, whereas the cluster emission 
is shown in blue. 



normalizing factor 8Q ^ . 2 • We determined this function 
by performing Monte Carlo simulations of pure poisson 
noise. The first term dominates for high count rates and 
follows the gaussian form, and the second term corrects 
for the fact that at low count rates the distribution is 
non-gaussian. The detection significance, therefore, de- 
pends both on the number of photons in the source and 
the source size. We also found it necessary to calculate 
the signal to noise as well as the source significance (sig- 
nal divided by the square root of noise), because there 
were highly significant large sources that had low signal 
to noise. We then have a set of candidate sources at 
various positions and sizes. We only select sources with 
significances greater than 6 sigma and signal to noise 
greater than one. These limits were chosen so that we at 



most have one spurious source in the entire survey due 
to background fluctuation and do not include low sig- 
nal to noise sources that may be due to non-statistical 
background calibration errors. 

At this point, the candidate sources could overlap and 
be at adjacent locations because they all have enough 
detection significance. To select a unique set of sources, 
we simply remove any candidate source that is within 
3 source radii from another and has lower significance. 
The source radius is defined as the maximum a of the 
two sources being compared. We found that this visu- 
ally results in only one object being detected per actual 
source quite efficiently. 

To test the ability to correctly measure the flux and 
size of sources, we performed a series of photon Monte 
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Fig. 9. — The full catalog of cluster candidates (continued). The subtracted AGN emission is shown in red, whereas the cluster emission 
is shown in blue. 




Fig. 10. — The full catalog of cluster candidates (continued). The subtracted AGN emission is shown in red, whereas the cluster emission 
is shown in blue. 



Carlo simulations. We first randomly generated between 
30 and 10,000 photons representing a cluster using a (3 
model. The core radius was randomly chosen between 2 
and 10 pixels and (3 was chosen between 0.5 and 0.75. 
We then chose a background level randomly between 
10 -6 and 10 2 counts per pixel having Poisson noise. This 
simulates the wide range of sources and conditions that 
we might find. We then used the source finding algo- 
rithm on the simulated image and found that the source 
flux was accurately recovered and the source size was ac- 
curately measured even in the presence of the different 
background conditions. This is shown in Figure [5l The 
typical scatter from the true values was 20% and 30% for 
the source size and flux, respectively. We also note that 
we recovered 495 out of the 500 sources we simulated. 



This indicates that our source finding algorithm is fairly 
robust to a range of background levels and source fluxes. 

We then have a large number of cluster candidates. We 
found it necessary to employ four further selection crite- 
ria to remove non-clusters. First, we computed the ratio 
of the counts in the cluster candidate to the counts in 
the AGN map. We required this ratio to be greater than 
4. Without this requirement, AGN which have increased 
flux between the Chandra and XMM- Newton exposures 
will be undersubtracted. This is rare since the variabil- 
ity is typically only about 10%, but there are also many 
AGN candidates. 

We also found it necessary to require that the source 
size plus three times the measurement error is at least 4 
arcseconds larger than the local PSF size, 
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v /(size + 3 ( T size )2-(PSF)2>4" 

This is a very conservative cut that removes about 30% 
of the potential candidates. Since, we have added three 
times the source size error, we are only effectively re- 
moving candidates that have very little chance of actu- 
ally being legitimate clusters. We then expect that only 
a small fraction of this 30% would be real clusters or 
groups that are removed accidentally. The local PSF size 
is estimated by combining quadratically the estimate for 
the local Chandra PSF given in the previous section with 
the XMM- Newton PSF in proportion to their exposure 
area. The XMM-Newton PSF is assumed to be 10 arc- 
seconds across the field of view, since it changes shape 
but its size does not change dramatically. The size error 
is determined by simply dividing the size by the square 
root of the number of counts. Thus, by using the AGN 
photon subtraction method above we have removed the 
vast majority of AGN emission but there is some residual 
background due to AGN variability. Although this lim- 
its our ultimate sensitivity, our method is several times 
more sensitive than just finding sources and estimating 
the size without the photon subtraction. 

We also estimate the ellipticity of the source by cal- 
culating the second moment of the counts weighted by 
a gaussian with width equal to the size of the source. 
We then require the ellipticity to be less than 0.25. This 
does not remove any clusters, but removes some spurious 
sources due to the readout streak. 

We also removed cluster candidates where the ratio of 
counts in the soft band over the counts in the hard band 
(the spectral softness, s) follows the equation, 

S - 3(7 J > 1 

This cut cleanly removes some systematic fluctuations 
in the background. The softness of the background is 
defined as exactly 1, since we set the middle our bands 
to be the median of the energy as described previously. 
For XMM-Newton this results in energy range of 0.5 to 
1.7 keV for the soft band and 1.7 to 7.0 keV for the hard 
band. For Chandra this results in an energy range of 0.5 
to 2.2 keV for the soft band and 2.2 to 7.0 keV for the 
hard bands. A high temperature, ~ 8/ceV, cluster has 
a softness near 2.5, however, and a lower temperature 
cluster has a much higher spectral softness. 

Finally, we found it necessary for some of the final can- 
didates to be removed by human inspection, since they 
were obvious super novae remnants, planets, or knots in 
emission that had survived all of our cluster selection 
procedures. We utilized a 4 node computing cluster to 
perform the calculations for this procedure on all the data 
sets in about a week, and perfected it after many iter- 
ations. After all the data selection procedures, we have 
1198 cluster candidates as shown in Figure 6. Many of 
these clusters were known sources and were in fact the 
targets of the relevant observation. There are 462 clus- 
ters that are outside of 4 arcminutes from the central 
pointing. Later we will argue on the basis of the log N- 
log S distribution that this is a reliable estimate of the 
number of new clusters. 



After all these cuts we were able to use the AGN in our 
sample to estimate how many of these objects would sur- 
vive if they were incorrectly chosen as cluster candidates 
due to some mistake in the photon removal procedure. 
We find that 19 AGN would survive these cuts that were 
more than 4 arcminutes outside the central pointing. 
Thus, we estimate that the contamination in our survey 
of false clusters is about 3%. We also need to estimate 
the efficiency of not removing real clusters. We have, 
however, performed Monte Carlo simulations that found 
99% of all of our simulated clusters. We note, however, 
that the cuts we have used above are designed to only 
remove objects that have completely inconsistent prop- 
erties of a normal cluster. The one cut to worry about 
in particular might be the size cut, which could remove 
some particularly small groups. This cut, however, only 
removes 30% of the candidates so even if some of these 
were real clusters it probably would not be much more 
than 3 to 5 %. Furthermore, none of the cuts remove any 
of the brightest 100 already well-known clusters. Thus, 
we estimate that the efficiency of these candidate cuts is 
around 95% and that might roughly cancel with the 3% 
estimation for contamination. Both of these estimates 
can be constrained more rigorously with simulations and 
optical follow-up. 

4. COMPARISON WITH PREVIOUSLY KNOWN CLUSTERS 

Since we have used archival observations, a significant 
fraction of our sample will be comprised of previously 
known clusters that were the targets of their respective 
observations. In order to compare the cluster candi- 
dates with previously known clusters, we used the NASA 
Extragalactic Database (NED). NED currently contains 
over 40,000 cluster candidates from over 300 surveys. In 
order to check our cluster candidates reliably, we decon- 
structed the NED catalog into the individual cluster cat- 
alogs. We then cross-correlated that catalog with itself 
to select objects within 1 arcminute of each other. A rea- 
sonable way to be confident that the cluster candidates 
are real is to simply require that the object appear in 
multiple catalogs whether they are X-ray surveys, opti- 
cal, or any other kind of survey. We found that 261 (out 
of 1198 that passed our selection cuts) clusters in our 
survey were previously known. There was also a signifi- 
cant number of sources at the center of the field of view 
that were nearby galaxies and other extended sources. 

5. LOG N-LOG S DISTRIBUTION 

We constructed the log N-log S distribution for the 
clusters in our sample as shown in Figure 11. We plot- 
ted the log N-log S distribution in both the cumulative 
and differential forms (Figure 12). The conversion from 
counts to flux was calculated by using WebPIMMS using 
a column density of 2 x 10 20 cm -2 , kT = 4keV and abun- 
dance of 0.4 solar. Although there is obviously variation 
in our clusters, the variation will be less than our flux bin 
width and these are expected typical values. The log N- 
log S distributions are then calculated by adding up the 
number of sources divided by the area of the survey that 
source in which that source could have been detected. 
Since we have a different flux limit for every point in the 
survey, we had to take this into account. Furthermore, 
our source detection sensitivity depends on the object's 
measured wavelet size. Therefore, we calculated the sur- 
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Fig. 11. — The log N-log S distribution for all the cluster candidates (black), the XMM-Newton/Chandra overlap candidates (red 
diamonds), and the Chandra only candidates (green diamonds). The models are (black lines from top to bottom): no evolution with 



: 2 x 10 41 ergs s" 



2.7 x lCT 7 Mpc- 3 , no evolution with a = 1.6, L* 71 



2 x 1 Q 42 , = 1 .8 X 10~ 7 , and evolution 



a = 1.7, LJ m * n : 

mode ls with zp = 0.7, 1.0, 1.3 using t he latter se t of parameters (see text). The blu e lines are those of j Bauer 2002| (ci rcle), [ Giacconi et al. 
2002 (square), McCard v eFaTI fl998l (triangle), Hen ry et al~|[200ll f dashed b ox), Ros ait et aIH l998 (dotted box), [Vikhlinin et al. 1998 
(solid box), Bohringer et al. 2001 (solid line on right), and lEbeling et al.H1998l (dotted line on right). 



vey area as a function of flux for each object taking into 
account our source selection function in Figure 13. For 
each square arcminute of the survey we calculate the to- 
tal exposure and multiply that by a given flux. This gives 
us an estimate of the number of photons that would be 
produced by a source at that particular place in the sky. 
Then we see if that would pass both our significance cut 
described in Section 3.4 and our signal to noise cut for the 
background at that particular place in the field. Thus, at 
high source fluxes the survey area is the full survey, but 
at low source flux it will be lower. We have assumed that 
our other selection cuts do not remove significant num- 
bers of real clusters and largely remove spurious data 
artifacts. Our earlier Monte Carlo simulations indicate 
for a wide range of sources that most of these (99%) are 
recovered by the source finding algorithm. 



We examined the subset of sources located a distance 
A from the centers of observations in which they were 
detected, and varied A until the log N-log S distribu- 
tion convered. This occurs for A ~ 4 arcminutes. We 
take this as an indication that the application of that 
cut removes the bias associated with selected pointings 
at known clusters. We also separated the Log N-Log 
S points for the overlapping Chandra and XMM- Newton 
survey (red diamonds) and the Chandra only data (green 
diamonds ). The difference in these surveys gives us a 
rough measure of the systematic error, since these are 
completely different samples. 

We modelled the log N-log S distribution using the 
following method. First to be consistent with previous 
large angle surveys we adopted t he Schechter function 
as outlined in the review bv lRosati. Borgani. fc Normanl 
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Fig. 12. — Identical plot to the previous plot except in differential form. 



I200I 



dN 



= <P_lLx\ (-L./LZ) (-L: min /L w ) 

dL x L*\L*J 

where we added the last factor to possibly cut off the 
luminosity on the low luminosity end to account for small 
groups that will not even emit X-rays even if they obey 
scaling relations. We used the values of a — 1.7 (has 15% 
scatter in the literature), L* = 4 x 10 44 ergs s _1 , </>* = 
2.7 x 10~ 7 Mpc~ 3 (has 50% scatter in the literature) from 
iRosati, Borgani, fc Normanl I2QQ2I which summarize the 
previous work at the high flux end. We also used slightly 
lower values of a = 1.6 and 0* = 1.8 x 10 -7 Mpc -3 for 
a second calculation than the mean values measured in 
other work but still within the observational uncertain- 
ties to be conservative about the measurement of evolu- 
tion as we discuss below. We then use this luminosity 
function and the comoving volume element of the Uni- 
verse for ACDM (ft m = 0.25, ^ A = 0.75) to populate the 



Universe with clusters as a function of redshift. Then, we 
convert the luminosities of the clusters using the appro- 
priate luminosity distance for the same cosmology. We 
also predict the fraction of these clusters that would be 
missed by our size cut by estimating the physical size, R 
of the cluster following the relation of 



R = IMpc 



10 46 ergs s~ 



0.45 



We determined this relation by looking at the clusters in 
our sample where we knew the redshift. This physical 
distance is converted into an angular size using the an- 
gular diameter distance for the same cosmologies. This 
allows us to predict the log N-log S distributions in Fig- 
ure 11. 

We varied L* x min between 2 x 10 41 ergs s" 1 and 2 x 
10 42 ergs s _1 which had an minor effect on the curves. 
A luminosity of 2 x 10 42 ergs s _1 would correspond to 
a object with virial temperature around 400 eV and a 
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Fig. 13. — The survey area in square degrees as a function of flux for different source sizes (4 (left most) to 256 (right most) arcseconds 
in multiples of 2). The blue curves are the overlap survey and the red curves are the part of the survey where there is only Chandra data. 
The Chandra-only survey has a larger area, but does not go as deep as the XMM/Chandra overlap survey. The dotted curve shows the 
area after the 4 arcminute central pointing region is excluded. 

luminosity of 2 x 10 41 ergs s _1 would correspond with 
a virial temperature around 100 eV. Even if there are 
many objects in the Universe with these sizes they would 
stop emitting significant X-rays somewhere in this range 
since the flux is located out of the X-ray band in this 
temperature range. 

Similarly, the cosmological parameters used in the cal- 
culation of the volume element have a small effect as 
well. We adopted the concordance values of Q m = 0.25, 
= 0.75, and Ho = 72kms _1 Mpc _1 . If we assume no 
evolution in this luminosity function then the data are 
completely inconsistent with the model using the nomi- 
nal set of parameters for (/>*, L* mm , and a as well as a 
conservative set of values for these parameters as shown 
in Figure 11. On the other hand, if we modify 0* by 
a factor of (1 + z) q we can match the data only if q is 
greater than -2. This demonstrates that we have mea- 
sured the onset of cluster formation and have observed 



the dearth of clusters at high redshift. Since the 1 + z 
formulation shows rapid evolution, we have also simply 
tried to represent this rapid evolution as a e _i o instead. 
In Figure 11, we plot values of zq of 0.7,1.0, and 1.3 along 
with the no evolution case. Given the parameters of the 
model we have used, the data points towards rapid evolu- 
tion at relatively low redshift of z=0.7 to 1.3. The exact 
value may depend somewhat on our choice of param- 
eters for the normalization and slope of the luminosity 
function. It also might be affected by unaccounted for in- 
efficiencies of the survey which could reduce the number 
of observed clusters by 10 to 20%. Note, however, that 
the size cut only mildly reduces the number of clusters 
and that cut would have to be falsely removing clusters 
as large as 20 arcseconds instead of 4 arcseconds to ex- 
plain the evolution. To demonstrate this, we show the 
effect of a zero arcsecond and eight arcsecond cut in Fig- 
ure 13. Generally, there is little effect below a flux of 
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Eisenstei rTfc Hul 1999 in an identical calculation to that 
of lHaiman et al . 2001 for three different mass of clusters. 



Fig. 14. — The log N-log S distribution for the cluster candidates showing the effect of the size cut. We plot the same curves as in Figure 
11, but we show both a size cut of zero arcseconds (lower curve) and eight arcseconds for the same model parameters as before except we 
only include zo = 1.0 for clarity. Clearly, the size cut has very little effect above fluxes of 10 -14 ergs cm -2 s _1 . 

10 -14 ergs cm -2 s _1 . For that reason, it is very likely 
that the observed deficit at low fluxes is due to an ac- 
tual absence of clusters at high redshift. Note also that 
our exponential function describes the number density of 
clusters and not the absolute number of clusters which 
will peak at higher redshifts. 

This rapid evolution is generically predicted in mod- 
els of structure formation with a non-negligible amount 
of matter in the Universe. Structure formation models 
predict a decrease in both the normalization of the clus- 
ter mass function at high redshift as well a s a change 
in shape of the function ([Jenkins et al.|[200TI ). Further- 
more, if there is significant cluster evolution as we have 
argued, then the parameterization of evolution in terms 
of (1 + z) q or e _I o is an oversimplification. In Figure 
14, we plot the range of exponential functions derived 
to match the log N-log S distribution. We then overlay 
the expected numb er density of clusters combining the 
I Jenkins et al.l l200ll theory with the power spectrum of 



We use the concordance parameters above and also set 
a 8 = 0.75, Q b 0.04, w = -1, and n s 1.0. The 
agreement clearly demonstrates that the rapid evolution 
that we have measured is similar to what is expected 
for reasonable cluster masses. Future detailed measure- 
ments and modelling when masses and redshifts can be 
estimated may measure the exact nature of the cluster 
evolution. 

The structure formation model predicts an evolution 
that varies as a function of mass, so we can predict the 
log N-log S distribution directly as an additional check. 
We must make the additional assumption of a mass- 
luminosity relation and assume that the theoretical mass 
function agrees well with the local lumino sity function. 
We adopt the mass-luminosity relation of IStanek et al.l 

120061 where L44 = 3.8 ( 10 i^ h -i ) • We ignored any red- 
shift dependence of the relation to be conservative, since 
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Fig. 15. — The expected number density of clusters compared to the present number density. The blue dashed region is the region 
enclosed by our measured exponential function. The three curves correspond to the number density for cluster of mass of 3 x 10 13 , 10 14 , 
and 3 x 10 14 solar masses (from right to left) using the calculation described in the text from structure formation theories for concordance 
cosmology. The exact shape of the rapid turnover is not important, but these curves demonstrate that structure formation theories predict 
a similar rapid evolution in the number density close to the observations. 



the change in the relation would make the evolution more 
significant. We also assume that simulated clusters with 
calculated luminosities below 2 x 10 42 ergs s _1 are sup- 
pressed by the same exponential function as before. The 
direct log N-log S calculation is shown in Figure 15, and 
agrees quite well given the uncertainties in our mass cal- 
ibration. 

Our results are generally consistent with interme- 
diate flux ROSAT surveys at 10~ 13 ergs cm 2 s - 1 as 
measured in iHenrv et al. l l200lL iRosait et al.lll998l and 
IVikhlinin et al.lll998l . Near a flux of 10 -14 ergs cm 2 s _1 
ou r results deviat e som ewhat particularly with those 
of IVikhlinin et all H998 and are closest to those of 
Henr y et al. 1 12001 . A combination of some non-cluster 



contamination and an underestimate of the survey sen- 
sitivity could explain these discrepancies. Considering 
the error ranges, however, these measurements could still 



be consistent with some mild cluster evolution. Our 
results are compleme ntary to those at high fluxes, e.g. 
Bohringe r et al.l l200L and we have fixed our theoretical 
model consistent with those results. 

Our results are also broadly consistent with the recent 
work of Vikhlinin where cluster masses have been esti- 
mated for a sample of 36 clusters IVikhlinin et al.l fl998. 
In this work, he demonstrated that a lower redshift sam- 
ple (z ~ 0.1) had a different and lower normalization 
mass function than the cluster at higher redshift z ~ 0.5. 
The results were consistent with the concordance struc- 
ture formation model discussed above. In a later work, 
Vikhlinin measure d the equation of stat e parameter for 
dark energy to 10% IVikhlinin et aTll2008l . Since our sam- 
ple is larger and deeper, it might be possible to measure 
the equation of state parameter to 2% in a different mass 
and redshift regime. 
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Fig. 16. — The log N-log S distribution as before, but using the direct calculation of the distribution from structure formation theory. 
See the text for further details. 



Our results are inconsistent, however, with the log N- 
log S distribution determined using the Chandra deep 
field (|Bauedl2QQ2h . This was determined by using 130 
square arcminutes and finding six candidates clusters in 
that survey that suggested there was no evolution in 
the cluster luminosity function. These objects, however, 
have signal to noise less than 1 (table 1, column 6 of their 
paper), so would not survive our signal to noise cut. If 
we remove our signal to noise cut we find 3 out of the six 
objects. If only two or three are clusters, then we would 
be consistent with th is m easurement. Similarly , both 
iMcCardv et al. 1 119981 and iGiacconi et~al~l l2QQ2l found 
clust ers in deep and narrow surveys. IGiacconi et al. I 
120021 do not make a claim about cluster number den- 
sity. If we convert their data to log N-log S as in 
iRosati, Borgani, fc Norma n 2002, then they are consis- 
tent with our measuremen ts at 10~ 15 ergs s -1 b ut in- 
consistent at lower fluxes. iGiacconr et al. 1120021 argue, 
however, that many of these sources could be contami- 
nated by hard X-ray emission from AGN. 



Our results are consistent with previous measure- 
ments of the evolution of the luminosity functions 
where the redshifts were known in the Rosat Deep 
Cluster Surv ey and the Einstein Medium Sensitiv- 
ity Surveys (|Gioia et all [T 990 and see Figure 9 in 
iRosati. Borgani. fc Normanl 12002). although the inter- 
pretation is somewhat different. There the authors con- 
sidered both an evolution in the normalization and the 
value of L* x (in their notation the parameters are A and 
B). In our case, the evolution factor, q, is equivalent to 
A + B. The previous results have constrained the value 
of A + B to be approximately equal to —3, which is also 
completely consistent with our result. The other authors 
have argued, however, that the data are more consistent 
with mostly a non-zero value of £?, which we cannot dis- 
tinguish from the flux distribution alone. In addition, our 
sample is much deeper and it is likely that the Schechter 
formulation is not even correct to such a large range in 
redshifts. In any case, negative evolution in the clusters 
x-ray luminosity function and the emergence of clusters 
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in the Universe has been clearly observed. Additional 
detailed photon Monte Carlo simulations like those we 
have proposed above as well as optical follow-up to re- 
move spurious sources are needed to verify the efficiency 
and contamination we have claimed in the log N-log S 
calculation. However, the large deviation of the Log N- 
Log S curve at 10 -14 ergs cm -2 s _1 compared to the 
future model is beyond any reasonable expected system- 
atic error. 

6. FUTURE WORK 

There are several research areas that can be pursued in 
the future with this cluster sample. First, the efficiency 
of the survey can be carefully calculated by futher Monte 
Carlo simulations. Second, the bias of the survey can be 
more accurately estimated by performing a blind survey 
on a modest field. Third, the redshifts of the new clusters 
can be measured by optical and IR follow-up including 
cluster verification. Fourth, the cluster abundance can be 
measured by combining the efficiency calculation and the 



redshifts. Fifth, the X-ray estimated cluster masses can 
be compared to weak lensing measurements for at least 
some of the clusters. Then by assigning these clusters to 
a given mass and redshift, the structure formation theory 
can be tested in great detail. Finally, the XMM-Newton 
and Chandra archive continues to accumulate more data, 
so a much larger survey can be performed in the future. 
Some portion of this additional work is necessary prior 
to a robust estimate systematic errors of cosmological 
parameters. 
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